Classification and Powerlaws: The Logarithmic Transformation
نویسندگان
چکیده
Logarithmic transformation of the data has been recommended by the literature in the case of highly skewed distributions such as those commonly found in information science. The purpose of the transformation is to make the data conform to the lognormal law of error for inferential purposes. How does this transformation affect the analysis? We factor analyze and visualize the citation environment of the Journal of the American Chemical Society (JACS) before and after a logarithmic transformation. The transformation strongly reduces the variance necessary for classificatory purposes and therefore is counterproductive to the purposes of the descriptive statistics. We recommend against the logarithmic transformation when sets cannot be defined unambiguously. The intellectual organization of the sciences is reflected in the curvilinear parts of the citation distributions, while negative powerlaws fit excellently to the tails of the distributions.
منابع مشابه
دو روش تبدیل ویژگی مبتنی بر الگوریتم های ژنتیک برای کاهش خطای دسته بندی ماشین بردار پشتیبان
Discriminative methods are used for increasing pattern recognition and classification accuracy. These methods can be used as discriminant transformations applied to features or they can be used as discriminative learning algorithms for the classifiers. Usually, discriminative transformations criteria are different from the criteria of discriminant classifiers training or their error. In this ...
متن کاملClassification of Endometrial Images for Aiding the Diagnosis of Hyperplasia Using Logarithmic Gabor Wavelet
Introduction: The process of discriminating among benign and malignant hyperplasia begun with subjective methods using light microscopy and is now being continued with computerized morphometrical analysis requiring some features. One of the main features called Volume Percentage of Stroma (VPS) is obtained by calculating the percentage of stroma texture. Currently, this feature is calculated ...
متن کاملAnalytical Solutions for Spatially Variable Transport-Dispersion of Non-Conservative Pollutants
Analytical solutions have been obtained for both conservative and non-conservative forms of one-dimensional transport and transport-dispersion equations applicable for pollution as a result of a non-conservative pollutant-disposal in an open channel with linear spatially varying transport velocity and nonlinear spatially varying dispersion coefficient on account of a steady unpolluted lateral i...
متن کاملCorrection: Do the Rich Get Richer? An Empirical Analysis of the Bitcoin Transaction Network
1. Kondor D, Pósfai M, Csabai I, Vattay G (2014) Do the Rich Get Richer? An Empirical Analysis of the Bitcoin Transaction Network. PLoS ONE 9(2): e86197. doi:10.1371/journal.pone.0086197 Figure 10. Change of balances in one month windows. Increase (top) and decrease (bottom, vertical axis is inverted) of node balances in one month windows as a function of their balance at the beginning of each ...
متن کاملEstimation of Genetic Trends for Test-Day Milk Yield by the Logarithmic Form of Wood Function Using a Random Regression Model
Estimation of genetic trends is necessary to monitor and evaluate selection programs. The objective of this study was to estimate the genetic trends for milk yield in Iranian Holsteins cows using random regression test day model. Data set was consisted of 743205 test-day records from 1991 to 2008, which were collected by the Animal Breeding Centre of Iran. Breeding, environmental and phenotypic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JASIST
دوره 57 شماره
صفحات -
تاریخ انتشار 2006